AITopics | precise localization

Collaborating Authors

precise localization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Road Damage and Manhole Detection using Deep Learning for Smart Cities: A Polygonal Annotation Approach

Hossen, Rasel, Mistry, Diptajoy, Rahman, Mushiur, Hridoy, Waki As Sami Atikur Rahman, Saha, Sajib, Ibrahim, Muhammad

arXiv.org Artificial IntelligenceOct-7-2025

Urban safety and infrastructure maintenance are critical components of smart city development. Manual monitoring of road damages is time-consuming, highly costly, and error-prone. This paper presents a deep learning approach for automated road damage and manhole detection using the YOLOv9 algorithm with polygonal annotations. Unlike traditional bounding box annotation, we employ polygonal annotations for more precise localization of road defects. We develop a novel dataset comprising more than one thousand images which are mostly collected from Dhaka, Bangladesh. This dataset is used to train a YOLO-based model for three classes, namely Broken, Not Broken, and Manhole. We achieve 78.1% overall image-level accuracy. The YOLOv9 model demonstrates strong performance for Broken (86.7% F1-score) and Not Broken (89.2% F1-score) classes, with challenges in Manhole detection (18.2% F1-score) due to class imbalance. Our approach offers an efficient and scalable solution for monitoring urban infrastructure in developing countries.

artificial intelligence, detection, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2510.03797

Country: Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.26)

Genre: Research Report (0.64)

Industry: Transportation > Ground > Road (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

No Need to Look! Locating and Grasping Objects by a Robot Arm Covered with Sensitive Skin

Bartunek, Karel, Rustler, Lukas, Hoffmann, Matej

arXiv.org Artificial IntelligenceSep-12-2025

This work has been submitted to the IEEE for possible publication. No Need to Look! Locating and Grasping Objects by a Robot Arm Covered with Sensitive Skin Abstract-- Locating and grasping of objects by robots is typically performed using visual sensors. Haptic feedback from contacts with the environment is only secondary if present at all. The main novelty lies in the use of contacts over the complete surface of a robot manipulator covered with sensitive skin. The search is divided into two phases: (1) coarse workspace exploration with the complete robot surface, followed by (2) precise localization using the end-effector equipped with a force/torque sensor . We systematically evaluated this method in simulation and on the real robot, demonstrating that diverse objects can be located, grasped, and put in a basket. The overall success rate on the real robot for one object was 85.7% with failures mainly while grasping specific objects. The method using whole-body contacts is six times faster compared to a baseline that uses haptic feedback only on the end-effector . We also show locating and grasping multiple objects on the table. This method is not restricted to our specific setup and can be deployed on any platform with the ability of sensing contacts over the entire body surface. This work holds promise for diverse applications in areas with challenging visual perception (due to lighting, dust, smoke, occlusion) such as in agriculture when fruits or vegetables need to be located inside foliage and picked. Perception for robot manipulation has been dominated by visual inputs from cameras (RGB) or depth cameras (RGB-D). Classical methods have been used for object segmentation and pose and shape estimation to feed the synthesis of grasp proposals for a robot hand (e.g., [1]).

artificial intelligence, experiment, simulation, (15 more...)

arXiv.org Artificial Intelligence

2508.17986

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (1.00)

Add feedback

Augmented Reality without Borders: Achieving Precise Localization Without Maps

Puigjaner, Albert Gassol, Aloise, Irvin, Schmuck, Patrik

arXiv.org Artificial IntelligenceSep-4-2024

Visual localization is crucial for Computer Vision and Augmented Reality (AR) applications, where determining the camera or device's position and orientation is essential to accurately interact with the physical environment. Traditional methods rely on detailed 3D maps constructed using Structure from Motion (SfM) or Simultaneous Localization and Mapping (SLAM), which is computationally expensive and impractical for dynamic or large-scale environments. We introduce MARLoc, a novel localization framework for AR applications that uses known relative transformations within image sequences to perform intra-sequence triangulation, generating 3D-2D correspondences for pose estimation and refinement. MARLoc eliminates the need for pre-built SfM maps, providing accurate and efficient localization suitable for dynamic outdoor environments. Evaluation with benchmark datasets and real-world experiments demonstrates MARLoc's state-of-the-art performance and robustness. By integrating MARLoc into an AR device, we highlight its capability to achieve precise localization in real-world outdoor scenarios, showcasing its practical effectiveness and potential to enhance visual localization in AR applications.

augmented reality, border, precise localization

arXiv.org Artificial Intelligence

2408.17373

Genre: Research Report (0.69)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.60)
Information Technology > Geographic Information Systems (0.53)
Information Technology > Artificial Intelligence > Vision (0.53)

Add feedback

Precise localization within the GI tract by combining classification of CNNs and time-series analysis of HMMs

Werner, Julia, Gerum, Christoph, Reiber, Moritz, Nick, Jörg, Bringmann, Oliver

arXiv.org Artificial IntelligenceOct-11-2023

This paper presents a method to efficiently classify the gastroenterologic section of images derived from Video Capsule Endoscopy (VCE) studies by exploring the combination of a Convolutional Neural Network (CNN) for classification with the time-series analysis properties of a Hidden Markov Model (HMM). It is demonstrated that successive time-series analysis identifies and corrects errors in the CNN output. Our approach achieves an accuracy of $98.04\%$ on the Rhode Island (RI) Gastroenterology dataset. This allows for precise localization within the gastrointestinal (GI) tract while requiring only approximately 1M parameters and thus, provides a method suitable for low power devices

classification, cnn and time-series analysis, precise localization, (2 more...)

arXiv.org Artificial Intelligence

2310.07895

Country: North America > United States > Rhode Island (0.24)

Genre: Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area > Gastroenterology (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Time Series Analysis (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

Add feedback